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(54) Title: EARSET COMMUNICATION SYSTEM 

00 

Abstract: A syslem and method for providing wireless communication which is conlroUed by voice recognition software run- 
nmg on a contmller. The system includes an earset communicator and a Base Station that allows wireless communication between 
^ Uiese elements. The earsetxommumcalor rests comfortably on the user's ear and is held in place by an earhook. Tlie transceiver Base 

O Station commumcates with the earset communicator and connects to a host controller, such as personal computer (TC") or a house 
^ ?nr on T ir ^"l^ '''''' 'T"^^"" '"'^ coHuection or phone line. Voice commands are used for many functions 

^ for contiDiimg the system. The Base Station routes the earset microphone audio to the controller software for speech recognition 
^ and command processmg. Speech recognition software on the controller interprets the voice command and acts accordingly 
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TITLE: EARSET COMMUNICATION SYSTEM 

FIELD OF THE INVENTION 

The present invention relates to an earset communication system. The earset 

communication system includes a hands-free earset for use in Voice over Network (VoN) 
communication, voice dictation, control of a computer, and/or voice control of a number of 
additional functions (e.g., home entertainment and home automation). 

BACKGROUND 

Office communication products and systems have evolved significantly since the 
introduction of the telephone over 100 years ago. Today, one's home or office desk is 
firequently equipped with terminal devices such as computers, personal organizers, pagers and 
telephones allowing a user the abihty to communicate by sending email, facsimiles, letters, 
telephone voice calls and voice messages. The development of these communication 
technologies has focused on providing the user with a choice of mediums for communication 
between terminal devices. However, along with the advantages resulting from the 
development of these communication mediums, the disadvantages of interfiicing with these 
mediums has increased significantly. 

A user today often must choose from a number of alternative communication mediums 
through specialized terminal devices such as telephone for voice, facsimile machme for a 
facsimile transmission of text and nnages, and computers for text, email, images, video, video 
confermcing and voice. Combinations of mediums and devices are also available, such as 
voice over P, and data over phone Imes providing the user an even greater variety of 
communication options. For exan^le, using Voice over Internet Protocol (VoIP), Internet 
telephony may be combined with other modes of communication, such as video conferencing, 
and data or appUcation sharing, giving a user tremendous power to communicate with others, 
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worldwide, at a fiaction of the cost of conventional telephone systems. A disadvantage of 
typical VoIP systems, however, is that the user is tied to his or her computer when using the 
VoIP functionality 

In contrast to traditional circuit switched telephone networks that are limited to 
5 transmitting voice or data within the conventional voice bandwidth, telephone switchmg 
systems aie rapidly transitioning to packet-switched networks. In the packet-switched 
environment, information is transmitted over the network in short bursts of data, known as 
"packets." Packet-switched networks are generally more cost efi&dent than circuit switched 
networks because they require no caU set-up time (resulting in faster deUvery of traffic) and 
10 because users can efficiently share the same channel (resulting m lower cost). 

The transmission of voice over a communication network may be referred to herein as 
Voice over Network CVoN"). Voice over Internet Protocol ("VoIP") is used herein to refer to 
a specific fonn of VoN transmission: voice communication over packet-switched networks 
using the Internet Protocol. Today, VoIP is currently the most common implementation for 
15 VoN in the consumer market and is yet another selection available to a user for 

communicating more efficiently than ever before. However, increasing the number of 
communication mediums also increases the complexity of communicating because the user 
must decide which medium will be used and then interlace with the appropriate device. 

Although an office user has a wide variety of communication mediums to select from, 
20 the different systems typicaUy each require their own user interface, resulting in increased 
complexity and ergonomic problems. In addition, an office user's woiE space must also 
provide a substantial amount of space for myriad devices, including for example a telephone- 
speakeiphone, a computer keyboard, a mouse, a monitor, speakers and amicrophone, a 
camera for voice and video over IP ^Kcations. andperh^s a personal digital assistant 
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C*PDA"). Electrical connections are required for each product also creating messy cable 
nests. Additional office communication products may also be required or desired such as 
cellular telephones, pagers, printers, scanners, dictation machines and personal organizers, 
forther increasing ergonomic problems by increasing options presented to the user, and furllier 
reducing valuable desk space as well. 

The net effect of havmg multiple user-machine mterfaces may actuaUy result in 
reduced efficiency and productivity for the office user. As the number of communication 
devices increases, significant overlap in ftinctionaUty and in hardware occurs. For example, a 
conventional telephone-speaketphone is largely redundant hardware for users with cordless 
telephones, ceUular telephones and/or computer-based telephony devices. The telephone 
keypad and display is redundant with the computer keyboard and monitor since these 
functions may be combined to simpHfy the user-machine interface and reduce the required 
desk-top space. Existing devices also require the user to operate and maintain multiple 
terminal devices further contributing to ergonomic inefficiency. A typical modem office has 
poor ergonomics due to the incompatibility between these multiple devices and multiple 
interfeces, thereby requiring a user to learn to use and maiutain each of them effectively. 

Office communication device manufacturers have attempted to unprove the user- 
machine interfece by developing hands-free products and wireless communication systems in 
order to eliminate handsets and to promote freedom of motion. Although hands-free devices 
freed the user from having to hold a handset, these devices were limited to merely being 
extensions of a telj^jhone handset. A user still has to mamiaUy control the communication 
device whetiier it is a telephone, answering machine, fax machine or a computer for sending 
email. Conventional cordless telephones utihze an RFhnk to provide wireless 
communication between the handset and the base station. However, conventional cordless 
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telephones are limiteAto establishing a wireless link between the handset and the base station 
with manual control interfaces. ' 

Voice recognition systems were developed in order to convert speech into text based 
on the recognition of spoken words. For example, through the use of speech recognition 
5 software, a user does not have to use the computer keyboard in order to type text Speech 
maybe processed through a recognition algorithm resulting in the recognition of the word 
and the representation of the word as text or a computer display. These systems however have 
becai largely limited to word processing plications. 

Remote speaker and microphone systems are known in which a transceiver located in 
10 aheadsetiscapableofestabUshingalinkwithaportabletelephone. Such systems however, 
have several limitations. As an example. U.S. Patent No. 5,590,417, issued to Rydbeck, 
describes a wireless headset InU.S. Patent No. 5,590,417, the contents of which are 
incorporated herein by reference, the wireless headset is worn on the user's head and receives 
and transmits a voice conversation to a portable telephone. One significant disadvantage of 
15 such a system is that the system cannot control functions such as dialing or searching for a 
telephone number without afBrmative manual interface with the user. SpecificaUy, a user still 
has to manually enter the phone number and initiate the call, usuaUy by pressing a "Send" 
button. This has the disadvantage of distracting the user who maybe operating a vehicle, PC, 
or another conomunicaticn device. 
20 A further disadvantage of such systems is thata user must manually mute the 

microphone or remove the headset in order to switch from speaking^on the telephone to 
communicatmg on another device or speaking to others in the office without being overheard. 
The lack of the abiUty to command and/or control the communication device without manual 
intervention, therefore, limits tiie speed and efficiency of a user. 
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The communication and control challenges discussed above wilii reference to the 
home user may apply with equal force to the corporate setting. In the corporate office 
environment, the desirability of hands-free fbnctionaUty has been demonstrated by the 
proliferation of headset products. A disadvantage of the known products, however, is that 
they still require a manual interface for even basic communication tasks like telephony. For 
example, with known headsets, to place a call, the user typically has to manuafly take a 
telephone off-hook and dial a number or similarly input commands through a computer 
keyboard or mouse. It would be desirable to free the user from the requirements of such 
manual interfaces. 

Likewise, home automation applications are known for use with a personal computer. 
Again, however, the user is limited in Ihe sense that the user must still use traditional 
computer interfaces, such as a keyboard or mouse to input commands. It would be desirable 
to provide the user with the freedom to control home automation functions without any type 
of manual interface with the computer. 

No known system has successfully integrated the desired features into a system that 
provides a simple, intuitive control mechanism of the user's communication devices and 
medmms while also eKminating the need for redundant hardware and the requirements of 
numerous manual interfaces. It would therefore be desirable to have an improved 
communication system. 
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SUMMARY OF THE INVENTION 
An object of the invention is to improve the efficiency and productivity of a home or 
office user. Productivity is improved by replacing existing conventional user-machine 
interfaces with a single convenient user-machine interfece that is transparent to the user and 
5 responsivepredominatelyto voice commands. By eliminating multiple inefficient 

conventional user-machine interfeces. the invention improves Foductivity by consoUdatmg 
the fimctions of numerous communications interfeces to a single, hands-free, wireless 
communications interface. This also provides the advantage of eliminating Haining of the 
user to operate these various devices. 
10 In accordance with a first aspect of the present invention, the user is provided with a 

hands-free, wireless earset that operates as a communication interface. The ear^et may 
provide control and/or commraiication functionaUty in accordance with voice cormnands 
issued by the user. For example, by way of the eaiset, VoN communication is enabled 
without requiring a manual control interfece. For one embodiment, VoN communication 
15 includes VoIP. 

Another aspect of the invention is to provide an eaiset communication system that 
provides control fimctionality using speech recognition software rmming on a microprocessor- 
based appliance. In one embodiment, the earset is coupled by air interface to a base station, 
which is capable of comiecting to the microprocessor-based appUance (e.g., a PC, handheld 
20 computer, PDA, set top box, cable modem, and the like). In another embodhnent, tiie base 
station connects directly to a PC (personal computer) and uses sofhi^are ruming on the PC. 
Advantageously, the earset has. the potential to control networic fimctions, such as Intemet ^ 
connectivity, home entertainment functions (such as home TV, DVD, audio and'or video 
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systems and the like), boms automation functions and the like, when connected to liie 
microprocessor-based {^liance. 

The system includes an earset communicator and a base station that preferably allows 
wireless communication between these elements. The earset communicator allows hands-free 
5 and wireless operation of the communication system, thereby completely freeing the user 
from being confined to the desktop. Has base station operates with a voice recognition and 
control programs in a controUer, giving the user simple, fast and complete control of every 
communication c^abiUty through the controller, including for example conlrol of telephony 
and data. Therefore, one embodiment of ttie invention combines the commimication power 
10 and flexibility of a confroUer-communication system with control functionality via simple 
voice commands. 

In accordance with another preferred embodiment, the earset communication system 
may be used for communication via Internet telephony or VoIP, voice browsing of the 
Internet, voice dialing and control management, voice dictation, PSTN telephony, and/or 
15 home control functions. The system allows the user to access files, review paperwork, work 
on the computer, and handle other office or home related activities without being tied to the 
desk because the earset has no tethering wires. 

In accordance with another aspect of the invention, the eaiset is a Ughtweight battery 
powered device having a noise cancelmg microphone or microphone array. This earset has an 
20 advantage over existing headsets with long boom microphones because the microphone is 
located outside the user's field of vision so that the user can work without distraction, 
converse face-to-face, and even drink a beverage while using tiie phone or PC without having 
to move or remove the earset. The eaiset preferably aUows the user to converse on a caU, 
command other electionic devices, use both hands to type or perform other fimctions, get up 
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and move, around the entire home or smaU office, all without the need to detach wires, remove 
the earset or carry around a cordless telephone. 

In accordance with another aspect of the invention, the system automaticaUy dials 
stored telephone numbers based on a voice command. Thus, no phone number need to be 
5 remembered by the user, and no digits are required to be manually dialed. 

In accordance with yet another embodiment, the earset communicator interface 
provides tremendous advantages while in an automobile or involved in other activity that 
requires the use of both hands. Ausermay operate a mobile telephone, computer or other 
peripheral device in a hands-free mode. In one embodiment of the earset communication 
10 system, flie earset device communicates with a PDA, which is in turn connected to a network 
that is capable of supporting voice communication. Another embodiment of the earset 
communicator system utilizes the base station to communicate with both the earset 
communicator and the wireless telephone network. 

In accordance with yet another preferred embodiment, voice commands are used for 
15 aU control functions. Voice recognition software aUows the user to interact with, for example, 
a conq>uter via spoken commands to initiate a VoIP call. The system uses voice recognition 
for control functions such as placing phone calls and answering phone calls. Voice 
recognition software may also be utilized in conjunction wilh commands received through the 
earset to perform othqr functions, such as checking schedules and appointments, controlling 
20 functions for audio, video, lighting, HVAC,(Heating Ventilation Air Conditioning), motorized 
windows and doors, etc., voice browsing of the Internet, voice dictq^oii; and integration with 
existing 3"* party software to create unique vertical applications. 

The earset preferably rests comfortably on the user's ear and is held in place by an 
earhook. A transceiver base station communicates with the earset via a wireless link. In 
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accordance with a preferred embodiment, the earset communicator is extremely Ughtweight 
(approximately 28 grams, or 1 ounce) so that it may comfortably be supported entirely by the 
user's ear. without the need for an over-the-head band. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The preferred embodiments of the present invention are iUustrated by way of example, 
and not limitation, in the figures of the accompanying drawings in which: 

Figure 2 illustrates the earset; 

Figure 3 illustrates the inner side of the earset; 

Figures 4A & 4B illustrate two rear-view embodiments of the earset; 

Figure 5A is a functional block diagram of one embodiment of the earset 
conmiunicatiou system that illustrates audio flow information and Figure 5B further illusttates 
a plurality of types of the network interfece shown m Figure 5A; 

Figure 6 is a functional block diagram illustrating a VoN audio interface in the 
microprocessor-based appliance ^own in Figures 5A and 5B; 

Figure 7A illustrates a block diagram of a preferred embodiment of the earset 
communication system including hardware and software system components; 

Figure 7B Ulustrates an altemative embodiment of the earset communicatiwi system in 
which the network interface is incorporated into the base station; 

Figure 8A iUustrates the analog-to-digital and digital-to-analog conversions m the base 
station portion of Figure 7A and Figure 8B illustrates an altemative embodiment of that 
portion of the base station eliminating die analog portion of the path; 

Figure 9 illustrates generaKzed software flow for handling the issuance of a command 
by the user; 

Figure 10 illustrates the software flow for making a call; 

Figure 1 1 A illustrates the software flow for making a caU where the user provides the 
called party's name and location; 
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Figure 1 IB iUustrates ttie software flow for makmg a caU where the user provides only 
the called party's name; 

Figure 1 1 C illustratss the software flow for requesting the voice agen^ 
Figure 12 illustrates the software flow for retrieving schedule information 
Figure 13 illustrates the software flow for control of home entertainment fimctians. 
Figure 14 ilhistrates the generalized software flow diagram for home automation 
ftmctions. 

Figure 15 iUustrates a system for utilizing the eaiset and hase station in a Voice over 
Network in^lementatian. 



DETAILED DESCRffTION OF TBQE PRESENTLY PREFERRED 
EMBODIMENTS 



An earset communication system and method of using such a system are described 
with reference to the figures mtroduced above. As shown in Figure 1, the earset 
communication system includes three main components: a wearable transceiver, hereinafter 
referred to as the earset or earset communicator 10; atransceiver base station 20 having an 
interface to a microprocessor-based appUance 30; and a microprocessor-based appliance 30. 
The microprocessor-based appUance 30 preferably includes at least one network mterfece. 
such as a network card or modem for access to the Internet or a corporate network and a 
PSTN telephony interface, for example a voice modem or similar device for access to the 
PSTN. Such similar devices for access to the PSTN include, for example, the PhoneRider by 
MediaPhonics or the Intemet Line Jack by QuickNet. As describedfiirther below, the 
network mterface may alternatively be mcorporated into the base station 20. 

The microprocessor-based appliance 30 preferably utilizes voice recognition software 
and communication software modules to interface with a communication medium. The 
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microprocessoi'-based'^Uance 30 may be, for example, apersonal computer, a server, a 
PDA, a set top box, a cable modem, a handheld computer, or a web browsing kiosk. Media 
devices and other household controllers often are processor controlled, and therefore are 
enable of being integrated into the earset communication system. The microprocessor-based 
appUance 30 may utilize any type of computer architecture including conventional 
microprocessors and neural networked processors. 

As described further below, the network interface 300 provided by the microprocessor- 
based appliance 30 couples the base station 20 to a network capable of supporting voice (e.g.. 
the fiitemet, corporate intranets, corporate networks, the PSTN and the like). In accordance 
with a preferred embodiment, the network is a packet-svwtched network that supports VoIP, 
Voice over ATM, Voice over Frame Relay, Voice over cable. Voice over DSL, and Ae like. 
The network may be a wired network, a wireless network, or a combination of die foregoing. 
The network may be a local area network (LAN), but for communication applications will 
more typically be a wide area network (WAN), a combination of WANs, the Internet or the 
PSTN. 

The microprocessor-based appliance 30 includes a read-only memory (ROM) 
structure, a random access memory (RAM) stmcture, associated data and address buses, and a 
port for coupling the microprocessor-based appUance 30 to the base station 20. Jn accordance 
wilb a preferred embodiment, the port that couples the microprocessor-based appUance 30 to 
the base station 20 is a Universal Serial Bus ("USB") port. Other types of wired or wireless 
connection may alternatively be used. In addition, the microprocessor-based appUance 30 is 
preferably a personal computer. Those skiUed in the art of communications will recognize, 
however, that the microprocessor-based appUance 30 may alternatively be a handheld 
conq)uter, a PDA, a set top box, a server, a cable modem, a web browsing kiosk or the Uke. 
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As used hesrein, the pl^e wd) browsing kiosk refers to an appliance, which mcludes the 
microprocessor-based appUance 30 structure recited above, or equivalents thereto, that is 
specifically adapted for browsing the Intemet. 

TheEarset 

The earset 10 preferably includes an audio transducer 53, a speaker 52 and a 
microphone 50. as shown in Figures 7A and 7B. The audio transducer 53 may be used for 
ringing or other similar paging or notice type functions. Preferably the audio transducer 53 is 
enable of generating a tone that is loud enough to notify the user of an incoming call, page or 
the like. Alternatively, the speaker 52 may provide the notice-type functions of the audio 
transducer 53, although this is less preferable because volume limitations on the speaker 52 

may prevent the user fiom hearing the ringing orpaging tone when the earsetlOis not present 
on the users' ear. In accordance with an alternative embodiment, the user may hear audio 
from a speakerphone (not shown) instead of an audio transducer 53 or speaker 52. 

Figures 2 and 3 illustrate an exemplary form of tiie earset 10. The earset 10 is 
designed to be worn comfortably on the user's ear. As illustrated in Figure 3. the speaker 52 
ejrtends from the earset 10 and is configured to be inserted into the user's ear. The speaker 
may be surrounded by gel and/or foam to improve comfort and fit of the earset 10. 
Alternatively, the earset. 10 may be carried by the user. 

UnUke a headset, the earset 10 is preferably extremely Ughtwdght (approximately 30 
grams, or 1 ounce) so that it may comfortably be supported entirely by the user's ear. The 
earset 10 is supported upon the user's ear by an earhook 14, as shovm itf Figures 2 and 3. The 
eaAook 14 not only stabilizes the earset 10 on the user's head when worn on the ear but also 
orients the microphone 50 for reception of commands spoken by the user. This earhook 14 
may be comiected to the earset device 10 via a thermal plastic ring which has notched detents 
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forrepeatable positioning. The earhook 14 maybe made out of aplastic or flexible wire so it 
can mold to fit each ear comfortably. When the earset 10 is not worn on the ear, a Ughtweight 
eaihook/speaker is plugged into a 2.5 mm jack which is located between the optional battery 
charging and parking contacts which are shown in Figures 4A and 4B. 
5 As shown in Figures 2 and 3, the microphone 50 is mounted in a cavity at an end of 

the earset 10 that is distal from the earhook 14. For flie embodiment shown in Figures 2 and 
3, the microphone 50 is housed in an adjustable mini-boom. The microphone 50 housing is 
preferably acoustically insulated to minimize couphng of unwanted mechanical noise. The 
microphone signal Kne is preferably electricaUy shielded to prevent the couplmg of uawanted 

10 RF energy. The use of the mini-boom, or equivalently— the extension of the length of the 
earset toward the hp plane, is required for the high signal-to-noise ratio demanded by 
currently available voice recognition software. From the standpoint of the user, and for 
simpHfication of the mechanical design of the earset 10, it would be preferable to eliminate 
the mini-boom and to instead simply mount the microphone 50 directly to the earset at a 

15 greater distance from the Up plane. It is envisioned that, as speech recognition software 
inq>roves and the noise background therefore becomes less pertinent, the mini-boom may be 
ehminated from the earset Other noise cancellation techniques known to those skiUed in the 
art, such as the use of a noise canceUng microphone an:ay, may be used as an alternative to the 
mini-boom, or in conjunction with Ihe nuni-boom to enhance audio quality. 

20 The microphone 50 is preferably a miniature, passive noise cancehng electret element 

with a cardioid response pattern. The mmi-boom is pivotally attadwd.to the body of the 
earset 10 to allow flie mini-boom to pivot away from the major axis of the earset 10. 
Preferably, the mini-boom may pivot vcp to approximately 20" away from the m^or axis. 
When the earset 10 is worn by the user, the end of the mini-boom locates microphone to the 



14 



wo 01/78443 PCT/DS0,1/J 1069 

side of flie usee's moutli and ^^proximately even wilh the lip plane \^dbile keeping the 
microphone out of the puff stream. 

In alternative embodiments, the mini-boom is eliminated and the single microphone 
50 and mini-boom are replaced by a microphone array with an associated DSP system that is 
programmed to reduce background noise and echoes. It is also envisioned that speech 
recognition software wiU in the future progress to the point where the noise cancellation 
techniques described above are not required. The obverse of the microphone 50 may be 
ported to enhance passive noise cancellatioa Either active or passive noise cancellation 
techniques may be used. For example, an array of microphones may be used with a adaptive 
combiner to select a weighted group of microphone signals to provide the lowest noise and 
therefore the highest signal to noise ratio. 

The speaker bud 1 14 shown in Figure 3 preferably extends from the body of the earset 
10 and is covered by an acousticaUy permeable foam cap, which acts as a cushion to prevent 
the convex covering of flie speaker bud 1 14 from icritating the ear. The speaker 52 is 
optimally capable of reproducing sound in the voice audio frequency band. The convex shape 
aUows it to self-seat, centering upon the ear canal (in the Concha), with minimal to no 
adjustment, when placing the earset 1 0 upon the ear. 

The earset 10 may be powered by a Ughtweight rechargeable battery 54, such as a 
Lithium-Ion Polymer battery. Other types of rechargeable batteries may alternatively be used. 
Without limiting the invention, a battety having the following characteristics is acceptable for 
the present appUcation, althougji other batteries may alternatively be used. The weight of the 
. battery may be approximately 7 grams or less. The dimensions may be approximately a width 
of 20 mm by length of 50 mm by a depfli of 5 mm. The battery may have an approximate 
capacity of 250 mAH or more, and be capable of powering the earset for more than 2 hours. 
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The apptoTdmate battery voltage may be ftom 3.3 V to 4.1 V with an ^proximate nominal 
voltage of 3.8 V. Future improvements in battery performance including increased volumetric 
energy density and increased gravametric energy density may also be utilized. The battery 54 
may be encased in a plastic pack that is mounted on the side of the earset 10 from the back as 
5 shown in Figures 4A and 4B. Preferably, the earset 10 includes battery charging/power 
contacts that are connected to the battery pack internally, i.e. through the earset. and the base 
station 20 includes mating contacts for charging the battery when the earset 10 is not is use. 
Alternatively, the battery may be removed fixmi the earset 10 for charging, such as in a 
charging stand that may be incorporated into the base station 20. 

10 The battery 54 is preferably located as close to the ear as possible to keep the center of 

gravity of the eaiset 10 nearest the center of the ear, and to'be positioned to balance the earset 
10. For power management purposes, the earset communicator 10 may normally be in a 
"sleep," or inactive, status in which most of its systems and components are powered down. 
In accordance with one preferred embodiment, the earset 10 also includes a set of 

15 parking contacts as illustrated in the alternative embodiment ofFigures4A and 4B. "When the 
earset 10 is in contact with the base station 20, and the parking contacts engage mating 
contacts on the base station 20, an identification code, which is commonly associated with a 
radio transceiver chipset within the earset 10, is sent by the eaiset 10 to the base station 20. In 
this manner, the base station 20 becomes associated with a particular earset 10. In other 

20 words, the base station 20 wiU communicate with the proper earset 10 even in an environment 
in which numerous earsets 10 are transmitting command signals. 

For those situations where the user does not wish to wear the earset 10 on their ear, the 
earset may be provided with a sq)arate speaker/microphone which can be plugged into an 
optional 2.5mm jack at the rear of the earset, as shown in Figure 4A. When the earset 10 is 
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ms«5rted into flie jadcthe audio is div«ted &om the internal microphone 50 and speaker 52 to 
a connected wired speaker/microplione or speakeiphone. Using a special cUp that may attach 
for example to the speaker bud, the user may then attach the earset to Ms or her shirt, or wear 
it with a lanyard around their neck. Since the wired microphone/speaker typically weighs 
only 1/8 of an ounce (3.5 grams), this may be a more comfortable arrangCTient for some users. 

Figure 7A illustrates ablodc diagram of apreferred embodiment 6f the earset 
communication system. Figure 7B illustrates an alternative embodiment. The system uses an 
RF link 180 to provide hands-free operation between a self-contained compact earset 10 and a 
base station 20, which has interfaces to a microprocessor-based i^jpliance 30 and a 
communication network 300. The earset communicator 10 comprises a radio frequency 
transceiver system 62, 60 for wireless radio frequency between the eaiset 10 and the base 
station 20. The radio transceiver 60 is preferably a 900 MHz Digital Spread Spectrum 
Transceiver Model No. RF105, which is commercially available from.Conexant Systems, 
Incorporated of Newport Beach, CA. This chipset, for example, will automatically select one 
of 40 available channels. By selecting the channel with the least interference and by utilizing 
DSS (Digital Spread Spectrum) technology, the system is interference tolerant. Radio 
transceiver 60 also preferably includes a Conexant 900 MHz Class AB KF Power Amplifier 
Model No. RF106 which provides a communicating range of approximately 250 feet (76 
meters). The earset Codec 58 is preferably a HummingJ)ird 100-pin ASIC + CODEC (single 
chip) Model No. RSST7504 or equivalent The base station Audio Processor 272 is 
preferably a 144 pin Hummingbird ASIC Model number RSST7504, and the base station 
CODEC 224 is preferably a 32 pin Hummingbird CODEC Model number 20415. The RF 
antenna 56 may reside within the plastic enclosure of the earset 10 provided the antemia 56 
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meets the minimum fractional wavelength requirements of the transmit frequency. The 
antenna 56 may be positioned along the outer edge of the plastic earset case. 

Alternatively, transceivers 62, 60 may each be a 2.4 GHz spread spectrum transceiver 
system such as is available from Siemens Electronics, or a 900 MHz chipset such as offered 
by Rockwell/Conexant (as previously discussed), or an Ericsson bluetooth chipset, Model No. 
PBA 313 m, or any other chipset that supports wireless communication. Typically these 
chipsets are based on a foil diq)lex analog, CDMA or TDMA technology formats. Chipsets 
from other manufecturers may alternatively be used, provided their air interface specifications 
provide high quality voice and security. One skiUed in the art is capable of identifying 
commercially available components for the air mterface in the system and would also 
recognize other substitute chipsets. An advantage provided by the 900 MHz and 2.4 GHz 
chipsets, however, is that they provide the earset 10 with a substantially longer usable lange 
than is available from known headset arrangements. 

The output of the earset radio receiver 60 is connected through the ASIC 108 to an 
amplifier in the CODEC 58 where the output portion of the audio circuit will drive the 
speaker 52. The output level of the signal sent to the earset speaker 52 is controlled digitally 
by the Hummingbird chip 108. 

A tone may be emitted fix)m an internal audio taansducer 53 to alert the user of a low 
battery state. In addition, an out of range tone may optionally be emitted by the internal audio 
transducer 53 when the earset 10 is not withm the recognizable range of the base station 20. 
When the earset 10 cannot sense the base station 20, the earset 10 preferably emits a specific 
tone, for example, periodically every 10 seconds. The eaiset 10 will emit a repeating ringing 
tone, preferably via the audio transducer 53, to notify the user of aa incoming caU. When the 
voice agent needs to present the user with a call notification, the microprocessor-based 



18 



wo 01/78443 PCT/USft.1/11069 

appliance 30 may send a signal to the base station 20, which in turn relays the signal to the 
eaiset 10 to begin the ringing tone. The user preferably may locate the earset 10 by activating 
a paging signal fiom the computer 30, or the base station 20 for the optional case in which the 
base station 20 includes a button for sending the paging signal. The earset 10 may emit a 
repeating paging tone cadence to allow the usa- to locate the earset 10. 

The earset communicator 10 contains controls that aUow the user to switch the earset 
10 to an "on" or active state when use of the earset functions is desired or necessary, such as 
when answering an incoming telephone call. A single button, i.e. a command button 1 10, on 
the earset communicator 10 prompts the microprocessor-based q)pliance 30 that a voice 
command is imminent As described further below, the user preferably receives, in response 
to the user depressing the command button 1 10, a configurable ready prompt through the 
earset intemal audio transducer 53 from the microprocessor-based appliance 30. The ready 
prompt notifies the user that the system is preferably ready to recdve a voice command. The 
ready prompt is stored on the microprocessor-based ^pliance 30 for example in a digital 
sound file format that allows the user to configure or record customized prompts. The earset 
intemal audio transducer 53 may also be used to notify the user of system status such as 
incoming phone calls, low battery status, paging signals, and "out of range" warnings. 

The Base Station 

The base station 20 is the communications gateway between the microprocessor-based 
appUance 30 and the earset 10 in the earset communication system. Reference may be made 
to Figures 7A and 7B for block diagrams of the base station 20, whgreifi thepreferred 
embodiment is illustrated in Figure 7A and an altemative embodiment is shown in Figure 7B. 
The base station 20 contains circuitry necessary to operate the earset 10. The base station 20 
footprint is preferably small relative to a desktop. In accordance with a preferred 
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embodiment, tfee base station 20 is smaU enough to be conveniently used while traveling, such 
as with a laptop computer. An internal KF antenna 22 may be used in order to provide a more 
aesthetically pleasing ^pearance, however, an external antenna 22 may alternatively be used. 
Antemia diversity may be utilized to increase signal to noise ratio and decrease RF 
interference. 

In accordance wiHx aprefetred embodiment, the transceiver base station 20 provides a 
USB interface 21 to the microprocessor-based appUance 30, having an associated memory 
structure. As previously noted, the microprocessor-based appUance 30 may be apasonal 
computer ('TO, PDA (personal data assistant), or other microprocessor-based device such as 
a set top box, cable modem, or other Internet device/appUance, or home control/automation 
system or other Internet services device. Other types of iaterfaces to the microprocessor- 
based appUance 30, such as RS-232, PCMCIA, Bluetooth or infrared, may alternatively be 
used. 

Figure 8A illustrates a portion of ttie base station 20 hardware from Figure 7A and 
iUustrates the fonn of the voice signal between the USB interface 21 and the Hummingbird 
ASIC 272. As shown in Figure 8A, flie voice signal is digital, such as 16 bit, SkHz Unear 
PCM data, between the ASIC 272 and the CODEC 224, which then converts voice signals 
from the ASIC 272 into analog form. The voice signal is then digitized by the CODEC 282 
and passed to the USB mterface21. The opposite conversions are made for signals traveUng 
from the USB interface to the Hummingbird ASIC 272. Hie intermediate conversion to 
analog form allows the Hmnmingbird ASIC 272 and the USB inte^e" 21 to operate using 
independent clocks. In an alternative embodiment in which the Hummingbird ASIC and the 
USB interface 21 operate on synchronized clocks, the intermediate conversion may be 
eliminated as shown in Figure 8B. 
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Preferably, % base station 20 draws power entirely from the USB connection 21 to 
the computer 30. Alternatively, the base station 20 may be powered from a DC power adapter 
connected to an AC power source, commonly known to those skUled in the art. This 
alternative power source may be required where flie base station 20 provides battery charging 
capability as noted above. The base station 20 may be a standalone unit, or may attach 
directty to the microprocessor-based appUance 30. For example, where the microprocessor- 
based appUance 30 is a laptop computer, it may be desirable to momit tiie base station 20 to 
the laptop for ease of use during transit For example, this permits the user to use the system 
for voice dictation while traveling. 

As an alternative, tiie base station 20 may be incorporated into the microprocessor- 
based ^liance 30, either by physicaUy incorporating the base station 20 hardware into the 

^pUance 30 form factor or, where the appliance 30 is akeady capable of supporting a 
wireless connection to the earset, by programming flie appliance 30 to perform the base 
station 20 fimctions. For exan5)le, it is envisioned that personal computers, PDAs, ceUuiar 
telephones and the like will include transceivers that support communication in accordance 
with the Bluetooth protocol. Those skiUed in the art would be capable upon reviewing this 
document of adapting tiie earset 10 to interface witii such appliances 30. 

The base station 20 provides an interference-resistant, secure KF link for multiple 
earsets. In one embodiment, tiie system may support up to 8 earsets. If multiple earsets 10 
are communicating simultaneously, tiiey act as "Conference Call" units, working in tiie same 
manna: as multiple wired telephones on a single line. The earset to base station range is 
preferably in excess of 75 meters in tiie presence of interference from sfructures such as walls 
and ceilings. The signal between earset 10 and base station 20 is preferably capable of 
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passing Ihrough a minimum of six standard wood stud and drywaU walls, wHch are typical of 
residential construction. 

The earset 10 has the abiHty to associate itself with a specific base station 20 when in 
the presence of multiple base stations within the reception area. For example, as described 
above, the earset 10 may include parking contacts that, as is known in the art of cordless 
telephones, allow the earset 10 and base station 20 to be logically mated. In the same manner, 
the base station 20 and earset 10 may be set up to use a particular encryption technology. 

One skilled in the art can readily implement such a system based on the air interface 
standards used in the radio transceiver chipset for the air interface 180. For example, the 
manufacturers of 2.4 GHz or 900 MHz digital spread spectrum chipsets associate a p jl 
(pseudo random) code for those chipsets based on CDMA technology and fliese chipsets are 
readily utiUzed in this system. This capability wiU allow multiple earsets or earset systems to 
function simultaneously. Because the earsels lOm^be logically mated with a base station 
20. flie system aUows many earsets 10 be associated with a single base station 20, or 
alternatively allows numerous, earset 10^ase station 20 pairs to be operated within the same 
area. 

Voice over Network Communication 
A preferred embodiment of the present invention provides advantageous use of the 
earset 10 with Voice over Network (voice over IP, voice over ATM, voice over Frame Relay, 
voice over cable, voice over DSL, and the like) technology. In Figure 7A. the 
microprocessor-based appHance 30 mcludes a network intaface 30Q that is accessible to the 
earset communication system via the software shown. In the altemative embodiment of 
Figure 7B, the base station 20 includes a network mterface 300, which may be a DAA or 
"Data Access Arrangement" where the mterface is to tiie PSTN. Tlie network interface 300 
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may be a conn^tion which couples the appKance 30 (m the case of Figure 7A) or the base 
station 20 fin the case of Figure 7B) to a conmninication hnk such as a data service, Internet 
service, cable modem type service, or a conventional telephone netwo± interface (also 
referred to as the "TelCo") 25 . For example, the network interface 300 may connect directly 
to an Intemet data service in order to provide VoN fimctionaUty m a consumer or home office 
environment. In a corporate appUcation, the network interfece 300 may connect to a LAN, 
WAN or coiporate network. 

Figure 15 iUustrates a block diagram ofan embodiment of the present mvention for 
using the earset 10 in conjunction with VoIP software to make an Internet-based 310 Voff 
call. The earset 10 may also be used within a coiporate telecommunications enterprise 390 to 
make voice over network calls when integrated with a coiporate VoIP (or any VoN) platfoim 
such as those offered by 3Com Coq)oration, Cisco Systems and others. 

Following are three exemplary scenarios describing the use of the earset 10 in 
conjunction with VoP software. In the first scenario, as illustrated in Figure 15, VoIP calls 
are made between the earset 10 and microprocessor-based appliance 30'. 

1. Microprocessor-based appliance 30 is connected to the IP network (Ihtemet) 310. 

2. User speaks into the earset 10. 

3. Voice is transmitted over the air interface 180 in a transmission to the base station 20. 

4. Voice is transmitted (digital) via USB 21 to the microprocessor-based appliance 30. 

5. Voice is transmitted to the IP cHent (software), as shown in Figure 6, on the 
microprocessor-based appliance 30. 

6. Voice is converted into IP packets and transmitted through the network interface 36, 
shown in Figure 6, to the microprocessor-based appliance 30' via the Litemet 310. 
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Note that microprocessor-based appliance 30 to microprocessor-based appUance 30' 
VoIP commumcations do not require a VoIP gateway service provider 320. There are a 
number of software packages (including the Mtemet Phone cUent software offered by 
ArialPhone LLC of Vernon ffiUs, Illinois, as well as Microsoft NetMeeting, Internet Phone by 
VocalTec, Cu-Cme and the like) that can be purchased or downloaded ftom the Intemet that 
allow users to talk to each other using their microprocessor-based appttances 30 & 30' and 
VoIP. 

In the second scenario, illustrated in Figure 15, VoIP calls are made between flie earset 
10 and telephone 380 or Corporate desktop equipment 390 via Centrex Service. 

1. Microprocessor-based appUance 30 is connected to the ff network (Intemet) 310. 

2. Userspeaks into the earset 10. 

3. Voice is transmitted over the air interface 180 in a transmission to the base station 20. 

4. Voice is transmitted (digital) via USB 21 to the microprocessor-based appliance 30. 

5. Voice is transmitted to the IP client (software), as shown in Figure 6, on the 
microprocessor-based appliance 30. 

6. Voice is converted into ff packets and transmitted through the network interface 36, 
shown in Figure 6, to an IP Gateway 320 via the Intemet 310. 

7. The IP Gateway 320, in this scenario typically part of the telephone company central 
office, converts the IP voice packets to analog and forwards the packets to the Central 
Office switch 330. 

8. Central Office switch 330 transmits analog voice to analog telgjhone 380 or to 
Corporate desktop equipment 390 via Centrex.. 

In the third scenario, iUustrated in Figure 15, VoIP caDs are made between the earset 
10 and Corporate desktop equipment 390 via telephone company Central Office switch 330. 
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1 . Mioropiocessdr-based appliance 30 is connected to the IP network (Ihtemet) 310. 

2. User speaks into the earset 10, 

3. Voice is transmitted over the air interface 180 in a transmission to the base station 20. 

4. Voice is transmitted (digital) via USB 21 to the microprocessor-based appUance 30. 

5. Voice is transmitted to the IP chent (software), as shown in Figure 6, on the 
microprocessor-based appliance 30. 

6. Voice is converted into ff packets and transmitted through the network interface 36, 
shown in Figure 6, to an IP Gateway 320 via the Internet 310. 

7. The IP Gateway 320, in this scenario typically part of the telephone company central 
office, converts the IP voice packets to analog and forwards the packets to the Central 
Office switch 330. 

8. Central Office 330 transmits to corporate PBX 370, or IP PBX 360 (in this case, there 
is an IP Gateway 350 between the CO (central office) 330 and the IP PBX 360 to 
convert analog voice into IP Packets). 

9. PBX 370 or IP PBX 360 transmits voice to the corporate telecommunications network 
390. 

As an altemative to the use of a telephone company Central Office switch 330 in scenario 
three, the IP packets may be routed directiy to an IP PBX 340 and deUvered in IP form to the 
corporate desktop equipment 390. Those skilled in the ait wiU recognize that the path in 
Figure 15 that will be used for communication with the corporate desktop equipment 390 in 
any particular case is dependent upoa the corporate desktop equipirienr390 hardware. 

A method is described below with reference to Figure 6 for interaction between the 
earset 10 and the microprocessor-based appUance 30 to make VoIP calls. It should be 
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recognized tha^ the same method appUes to other VoN protocols simply by replacing the IP 
client with an appropriate client that supports the desired protocol. 
When the earset 10 user is speaking: 

1 . The user speaks into the microphone 50 on the earset communicator 10. 

2. The earset communicator 10 transmits the analog voice to the base station 20 over the 
airintaface 180. 

3. The base station transmits the analog voice to tiie microprocessor-based appliaace 30 
using a USB connection 21 . 

4. The USB audio driver 32 passes the voice to flie IP Client appUcation 34. 

5. The IP cUent application 34 converts the analog USB voice to IP voice packets. 

6. The client application 34 transmits the IP voice packets to the microprocessor-based 
appliance's 30 network interfece 36, such as a card or modem. 

7. The PC's network interface 36 transmits the BP voice packets over flie Internet 310. 

When the earset 1 0 user is listening: 

1. The PC's network interface 36 receives IP voice packets and passes them along to the 
IP client application software. 

2. The IP client appUcation converts the IP voice packets to analog voice. 

3. The USB audio driver 32 passes the analog voice to the base station 20 via a USB 
connection 21. 

4. The base station 20 passes tibie analog voice to flie earset commanicator 10 over the air 
inter&ce (i.e. using a wireless transmission) 180. 
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Corporate Voice over Network 

The use of VoIP in the coiporate environment results in a significant reduction in the 
cost associated with intra-office (branch to branch), and inter-bffice communications. The cost 
of intra-office communication can be broken down into: equipment, maintenance, and 
telephone charges. Equipment and maintenance costs are the primary areas of savings for 
inter-office communications. VoN technology can significantly reduce these costs in the 
following manner: 

Equipment - Generally speaking, VoN equipment is less expensive than traditional 
telephone equipment. Additionally, with VoN technology voice traffic travels over the same 
network infrastructure as data traffic meaning there is no need to purchase and maintain a 
completely separate network to handle voice. 

Maintenance - Because VoN technology utilizes the existing data network there is no 
need to maintain a completely separate voice network. Also, existing IS staff generally has the 
knowledge to support and maintain the existing data network so there is no need to hire and 
train duplicate staff to manage the voice communicadohs component. 

Telephone Charges - Because VoN communication technology uses the existing data 
network, there is no need to lease separate lines to handle voice traffic in the case that the 
branch offices each have connected telephone equipment. In the event each branch office is 
not connected, and is using service provided by a long distance carrier, the savings can be 
greater because all long distance charges for intra-office calls can be e^ninated. 

In a preferred anbodiment, the earset communicator system and software may be 
integrated with the offerings of VoN providers to add significant fimctionaHty including: 
voice agent c^abifity to create 'Tntelligent Dial Tone," voice dialing, voice access to all 
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telephony features (park, call, transfer, etc.) and voice mail, and integration with corporate 
contact management and collaboration systems (Microsoft Outlook, Lotus Notes, etc.). 

For example, the earset communication system preferably includes a VoN telephony 
system to provide a highly convenient, highly functional alternative to the soft phone 
(computer software) or telephone handset hardware. The earset communicator 10 preferably 
supports functionaUty with both, VoN and traditional voice solutions. The embodiments 
disclosed do not preclude working with standard telephone services. All the telephone 
fimctions described in this section apply to any transport medum, however the physical 
transport medium in the case of VoIP is based on the Internet Protocol. 

Consumer VoIP 

Another embodiment of the eaiset communication system provides IP Telephony in 
the consumer market to provide firee or greatly reduced cost of long distance and international 
telephone calls. One drawback consumers fece when using VoIP to make telephone calls is 
the fact that they are tied to Iheir computer in order to receive the lowest possible rates (PC to 
PC or PC to Phone calls). That is, they are forced to use soft phone ftmctionality via a 
graphical user interface suppUed by the VoIP provider. They also generaUy must use a 
speaker and microphone combination wired to the computer. 

However, since the earset communication system includes a wireless connection to a 
microprocessor-based appHance 30, through the base station 20, users can make Voff calls 
fiom anywhere in the home, allowing them to use the earset communication system in 
conjunction with a VoIP provider to make calls like they might olhawise make using a 
standard telephone handset Another key advantage that the earset communication system 
adds to the VoIP platform is voice dialing, making the process of initiating and answering IP 
telephony caUs extremely simple and convenient. Additional fimctionality accessible via the 
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eaiset commuiucation system software, such as voice mail, call screening, and unified 
messaging, round out the VoIP offering and make the complete solution an improvement over 
llie existing analog telq}hone. 

The primary service providers in the consumer VoIP market are demonstrating that the 
potential firom this technology is significant Some of the current Voff providers are: 
Net2Phone. (http://www.net2phone.com). PhoneFree flittD://www.p hnnefa^ r■nn^^, and 
DialPad fhttp://www.dialpaid.cn| w). 

To effectively use VoIP today, a consumer may utilize a high speed Ihtemet 
connection like DSL or a cable modem (standard 33k- 56k dialup will also work, although 
the voice quaUty may be somewhat less than that of standard telephone service). One of the 
primary problems with using VoIP is the feet that the user is tied to their computer - a 
problem that earset communication system neatly resolves. In addition to VoIP fimctions, 
additional capabitities that are enhanced by the earset communication system include voice 
chat for instant messaging, and voice-based command and control applications. 

Instant Messaging Users 

With approxhnately 45 million users of AmericaOnline's AOL Instant Messenger 
(AIM), and approximately 50 million ICQ users, plus the users of Yahoo! Messenger, MSN 
Messenger, and others, the instant messaging maricet consists of a substantial user base. Some 
of these instant messaging products siqjport voice conversation, while others only offer text- 
based chats. Today, aU of these services require that users be at tiieir oranputera to engage in a 
chat. Integrating the eaiset communication system mto these products allows users to initiate, 
respond to, and engage in a voice-based chat via the instant messaging software from 
anywhere in the home. Even without such integration, tiie eaiset communication system 
enables the users of instant messaging software tiiat supports voice conversations to do so in a 
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hands free manner while the user is moving freely throughout the home (although users will 
still have to initiate and answer the chat at the conq)uter). 

Tdephone Commnnication 

Telephone communication will now be described with reference to Figures 5A, 5B, 
7Aand7B. Figure 5A is a functional block diagram oftheearset communication system. As 
shown, the microprocessor-based appKance 30 includes an inteifece 21 for communicating 
with the earset 10 via the base station 20 and also includes a network interface 300 for 
couphng the earset 10 via the j^jpKance 30 to a network 80 that supports voice 
communication. Figure 5B shows that the network interface 300 may include one or more of: 
a network connection, such as a connection to a LAN, WAN, the Internet and the hke, and a 
connection to the PSTN, such as by a USB PSTN interfece 46 or PSTN Telephony Interface 
48. The software 31 shown in Figures 5A and 5B is further described in Figures 7A and 7B. 
The software modules shown in Figures 7A and 7B. other than the earset agent appUcation 
320, are weU known to those skilled m the art and are widely available. The preferred earset 
agent appUcation is commercially available as the Arid Voice Agent software, from 
ArialPhone LLC of Vemon Hills, Illinois. 

Figure 7A fiirther describes the preferred embodiment m which the microprocessor- 
based appUance 30 includes both a network interfece or NIC and a PSTN Telephony 
Interfece. In the alternative embodiment shown in Figui© 7B, the microprocessor-based 
appliance's 30 PSTN Telephony Interface is replaced by the USB PSTN interface, which is 
illustrated as residing in the base station 20. As used herein, the PSTN Telephony Interface 
may be a voice modem. Dialogic D/41BSC. PhoneRider by MediaPhonics type boards, 
Internet Phone Jack by QuickNet type boards, or the hke. 
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The network ipterfece card 342 provides the interface for VoN communication, as 
described above with reference to Figure 6. Such cards are readily available from 3Com 
Coiporation of Santa Qara. California, Intel Coip. of Santa Clara, CaUfomia and others, and 
provide fidl-duplex capabilities. This interface is not utiUzed for PSTN telephony. 

hi accordance with the preferred embodiment of Figure 7A, the PSTN Telephony 
Interface m the microprocessor-based ^hance 30 mclndes DTMF dialer circuitry that is 
c^le of diaUng a phone number transmitted from the microprocessor-based appUance 30 
viaitsmtemalbus. The PSTN Telephony Interface may include Caller ID detection circuitry 
that is capable of passing a caller's telephone number and test string to the microprocessor- 
based appUance 30 via its mtemal bus. In addition, the PSTN Telephony Interface preferably 
provides to the microprocessor-based ^pliance 30 audio I/O support of 16-bit, 8-KHz PCM 
formats: unsigned Imear. G.71 1 . Preferably, a four conductor RJ-1 1 jack may be used to 
couple the PSTN Telephony hiterface to a telephone hne. 

Preferably, the PSTN Telephony hiterface also has fiiU-duplex audio circuitry that is 
capable of takmg a first audio stream from the telephone Ime and placing it on the mtemal bus 
of the microprocessor-based appUance 30. The earset agent appUcation 320 m conjunction 
with the weU known device and media streaming drivers is capable of takmg the first audio 
stream from the mtemal bus and transmitting it to the earset 10 via the base station 20. In the 
same manner, the earset agent appUcation 320 is capable of placing a second audio from the 
earset 10 via the base station 20 onto the internal bus. The PSTN Telephony hiterface is 
capable of takmg the second audio stream from the mtemal bus andj>lpcmg it on the 
telephone Une. For fiiU-duplex communication, the first and second audio streams are 
processed shnultaneously m the earset communication system. 
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As the user speaks telephony control commands into the eareet 10, they are transmitted 
to the earset agent application 320 via the base station 20. In response, the eai-set agent 
appUcation 320 issues appropriate telephony control commands, such as on-hook, digit 
dialing, off-hook, flash, conference, mute and the like, to the PSTN Telephony Interface via 
the internal bus of the microprocessor-based ^pUance 30. In addition, the fuU-duplex audio 
processing will allow the earset agent appUcation 320 to record line or earset audio, and to 
communicate voice commands, play back PC audio to the line or earset 10. For example, the 
microprocessor-based ^liance 30 is able to send earset control codes to the base station 20 
to permit signalmg and prompting to the earset 10 to perform a specific function. 

In accordance with the alternate embodiment of Figure 7B, the base station 20 has 
DTMF dialer cuxiuitry (not shown) that wiD be enable of diahng a phone nnmbw transmitted 
from the microprocessor-based appliance 30 via the USB connection 21. The base station 20 
also may mclude Caller ID detection drouitry 23 that is capable of passmg a caller's telephone 
number and test string via the USB connection to the computer 30. hi addition, the base 
station 20 preferably provides to the microprocessor-based appUance 30 audio I/O support of 
16-bit, 8-KHz PCM formats: unsigned linear, G.71 1. In tenns of the telephone network 
interfece, the base station 20 includes a USB PSTN mterface 46. A four conductor RJ-1 1 jack 
may be used to couple the base station 20 via the USB PSTN interface 46 when connected to 
a telephone line. 

In one embodiment, the base station 20 also has full-duplex audio circuitry that is 
capable of communicating the audio stream provided via the USB Qomiection 21 to the 
microprocessor-based appUance 30. Usmg the USB connection 21, the microprocessor-based 
appHance 30 and base station 20 will communicate telephony control commands as well as 
fuU-duplex audio processmg. This allows the earset agent appUcation 320 via the 
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microprocessor-based appUance 30 to control fimctions such as on-hook, ofiF-hook, flash, 
conference and mute. In addition, the M-duplex audio processing wiU aUow the earset agent 
application 320 to record line or earset audio, and to communicate voice commands, play 
back PC audio to the line or earset 10. For example, the microprocessor-based appUance 30 is 
able to send earset control codes to the base station 20 to permit signaling and prompting to 
the earset 10 to perform a specific function. 

During a conversation between the earset 10 and the network 80, as shown in Figure 
5A, the microprocessor-based appliance 30 may send an audio message to the earset 10, for 
example to alert the user of a caU waitmg. The earset agent application 320 may 
communicate separately and simultaneously with both the local and remote parties when the 
parties are not communicating with each other. For example, the local party may perform an 
Internet look-up while the remote party receives a recorded music stream. In addition, where 
no one is available at the earset 10 (i.e. local user), the earset agent application 320 via the 
microprocessor-based appliance 30 may communicate with the remote party to prompt the 
remote party to leave a message. 

Vertical Market Applications 

The unique form factor of flie earset communication system provides significant 
support for vertical market solution providers to offer new, highly diffeentiable services. 
Examples of vertical market services include: 

Public Safety 

AppKcation to allow a pubhc safety officer to interview incident witnesses and 
automatically fill out forms and reports via a voice based interface using the earset 
communicator system . The ^Ucation may also allow the officer to make voice requests for 
information via a central conq)uter or the Internet. 
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UtUity.Workers 

Application to aUow utiUty workers to make voice requests for specifications on 
equipment that they are currently working on. The information from the central computer or 
the Internet is requested and delivered via tiie earset communicator system . 

Medical/Legal Service Providers 

AppUcation that aUows voice dictation of case/procedure notes. The iqjpKcation may 
also allow the service professional to request and retrieve information via the earset 
communication system. 

Software Interfaces 

In accordance with a preferred embodiment, voice commands are used for all 
functions and control of the system. When a user activates the command button to issue a 
command, the base station 20 routes audio picked up from the earset microphone 50 to the 
microprocessor-based appUance 30, where speech recognition is ^pUed to the input 
command signal and the command signal is processed. Speech recognition software on the 
microprocessor-based appliance 30 interprets the voice command as described in greater 
detail below with reference to the software flow figures. In accordance with a first 
embodiment, only commands are routed to the microprocessor-based appUance 30 and not 
audio during a conversation with another party. Once the user has issued the command to 
make a call, communication audio (i.e., the audio from a VoN conversation) is notpicked up 
by the earset agent application 320. The reason for this is that it is not practical for a number 
of reasons to have the speech recognition software Usten to an entire conversation. This is the 
reason for the voice command button ^ to notify the earset agent application 320 to expect a 
conunand. 
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Opwation of flie eaiset comnmnication system in accordance with apreferred 
embodiment will now be described. As shown in the flow diagram of Figure 9, the user at 
step 120 may depress the command button 110 on the earset 10 and, after receiving a ready 
prompt at step 130 from the microprocessor-based appKance 30, Ihe user may speak a 
command at step 140, such as "Call Mr. Williams," or "Open Microsoft Outtook," or "Close 
the kitchen blinds," or "What is the temperature outside?" Once the microprocessor-based 
appUance 30 has received flie voice command at step 160 and confinned the command at step 
180 or 190, then the system software initiates the q)pnq)riate action at step 210. 

If the earset system is making a caU, the connection to the network 80, shown in 
Figure 5A, preferably is muted while the command is issued and being responded to so the 
remote party does not hear Ihe command. Ifthe command was not recognized at step 170 or 
at step 220, then Ihe user may again be prompted or asked to start over at step 1 10 or at step 
140. 

Preferably, the system utilizes tiie Lemout & Hauspie speech recognition engine 
model # ASR 16G0/M, which requires no voice training, no names or numbers to enter 
(assuming that the user already has names and numbers recorded m a contact 
management/address book system like Microsoft Outlook, Lotus Notes, Windows address 
book, etc.), and no leaimng curve to go through. One skUled in the art may readily adapt any 
appropriate commercially available speech recognition engme. The voice recognition engine 
will also preferably support multiple or alternative languages for example. EngUsh, Spanish, 
German, Chinese, French, Japanese to naine a few. The system may use the names that 
already exist in the user's contact file, through a dynamic mterface to Microsoft Outlook, 
ACT, Lotus Organizer, and similar products. 
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The solhvare that operates the system may be aa appUcation based on the Microsoft 
Windows 98 or Windows 2000 operating system (or any subsequent release) and wiU 
preferably comply with the "Designed for Microsoft Windows" Logo program, to which those 
interestedmayrefer. For the preferred embodiment of Figure 7A, the system preferably 
includes an open hardware platform for multimedia playback and recording as weU as button 
press events. 

For the alternative embodiment of Figure 7B, the system preferably includes an open 
hardware platform for telephony utUizing Microsoft's Telephony API standard. This allows 
other third party software applications to operate the required system hardware. The system 
software application uses TAPI 2.0 specification to communicate with the system. The 
system may also use the TAPI 3.0 specification when available or future versions as they 
become available. 

Microsoft provides support for universal serial bus (USB) 21 using the Microsoft 
Win32 Driver Model (WDM). Hardware vendors who implement USB solutions for drivers 
can use the drivers provided by Microsoft or can create minidrivers to exploit any additional 
unique hardware features. Features requiring a driver that are beyond (he fimctionality of the 
basic USB audio driver include audio channeling, earset and base station control signaling, 
telephony control, and the voice command button feature. The base station 20 preferably is a 
"Plug and Play" device as defined by the Microsoft PC99 (or PC2000) System Design Guide. 

The voice agent (VA), also referred to herein as the eaiset agent appUcation 320, is a 
speech-based interfece agent used to interact with the hardware aniothfcr third-party devices 
and software systems. To accompUsh its fimction. the voice agent utilizes program logic, a 
speech recognition engine, pre-recorded voice files, and text to speech synthesis where 
necessary. The VA may use dedicated hardware or other TAPI comphant telephony devices 
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for its audio VQ and telephony control. In addition, third party hardware and software 
systems like Savoys CyberHome. IBM's ViaVoice and various home automation devices can 
also be controlled through the VA, 

As shown in figure IOC, the process for initiating the voice agent is by pressing the 
voice command button 110 on the earset 10 or base station 20 (speakerphone) to activate the 
VA at step 120. This activation plays the ready prompt at step 130 of figure IOC through the 
earset speaker 52 and places the VA in a listening state. The period of time for placing the 
VA m a Ustening state is a system configurable option: for example, 2 seconds. If no speech 
is detected, the system wiU revert to its previous state. Further details on the activation of the 
voice agent and the ready prompt are provided below with reference to the description of the 
various use cases. 

The ready prompt may consist of a user recorded audio stream (WAV file), a pre- 
selected appUcation-ofifered audio stream, or a simple combination of tones. The ready 
prompt will be an appHcation configurable variable. For example, llie ready prompt may 
consist of: 

"Yes Steve?" 

For purposes of this explanation, all voice command dialogs wiU assume the voice 
command button 110 has been pressed and the ready prompt has been played. 

In another preferred embodiment, the system is capable of answering the phone and 
asking the remote party their name and who Ihey are calling. The caU may then be announced 
through wired or wireless speakers located strategically around thehouse or office tiwt are 
coirtrolled by the microprocessor-based appUance 30 running the software so the residents 
know who should answer the phone, and who is calling. This feature can also be used for 
paging and general annoimcements. 
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For example flie software can screen oat telemarketing calls. Many telemarketers use 
predictive dialers, which are simply computer programs that dial phone numbers and wait for 
a human to answer the phone. Telemarketmg calls by telemarketers using predictive dialers 
are screened out automatically because Iheir predictive dialer software makes the 
determination that a person has not answered the telephone and hangs up. The system may 
also identify the caUer thus ehminating the need for CaUer ID. Individual speakers in each 
room can be selected by the user or automatically by the software so that people may be paged 
and people may join a conversation. In a home appUcation, the system may announce when 
vehicles have pulled into fhe driveway, when any door has been opened, when there are 
visitors at the front door and when mail has arrived. 

Telephony Service Provider (TSP) 

A telephony service provider is a dynamic-link library (DLL) that supports 
communications over a telephone network through a set of e3q)orted service functions. Tte 
service provider responds to a telephony request, sent to it by flie TAPI, by carrying out the 
low-level tasks necessary to communicate over the telephone network. In this way, the 
service provider, in conjunction with TAPI, shields applications from the service and 
technology dependent details of the telephone network communication. 

Each service provider is responsible for respondmg to telephony requests from TAPI 
to control lines and telephone devices. A service provider is also responsible for controlling 
and assessing the information exchanged over a call. To manage this information (called the. 
media stream), the service provider must provide additional capabiUties or fimctions. The 
System TSP may optionally have configuration options to interface with PBX commands. 
These configuration options define what the flash, park, transfer, conference, forward, etc. 
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commands equate to in terms of hook flash commands. For example, a conference command 
may consist of "flash *2". 

Issuing a Command 

The processing of a command issued via the earset device 10 will now be described 
with reference to the flow chart of figure 9. Figure 9, which is preferably implemented in 
software, depicts a preferred method for handling a command issued by a user. As shown in 
Figure 9, and fiirther described below, the method preferably includes the abiUty to handle 
recognition errors. It will be recognized upon review of the following that Figure 9 depicts a 
generalized method for issuing a command. Specific exanq)les of particular commands will 
be presented separately below. Figures 5A and 5B illustrate the audio signal paths within the 
earset communication system associated with the general method.described in Figure 9. 

As shown m Figure 9, initiation of the processing of a user command begins at step 
1 15, where the initial conditions of the earset communication system are as follows: (1) the 
microprocessor-based appUance 30 is powered on; 2) the base station 20 is connected to the 
microprocessor-based appUauce 30, such as via a USB port; 3) the base station 20 is 
powered on; and 4) a voice agent communication software q>plication is running on the 
microprocessor-based appliance 30. 

At step 120, the user presses the coimnand button 1 1 0 on the earset 1 0, shown in 
Figure 2, which causes Ihe earset 10 to transmit a signal to the microprocessor-based 
appHance 30, tbrongh the base station 20. Upon receipt at the microprocessor-based ^pUaace 
30, the signal activates the voice agent. As described above, the voice«gent is preferably a 
speech-based interfece agent used to interact with the system hardware and other third party 
software products, such as Microsoft Outtook. Lotus Notes, Lemout and Hauspie Voice 
Express, Dragon Dictate (firom Dragon Systems), VoIP capable software (Net2Phone, 
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DialPad, Microsoft NetMeeting), Instant Messaging Products (ICQ, AOL Instant Messenger, 
Yahoo! Messenger), or any other voice enabled applications or ^Ucations that could benefit 
from being voice enabled. A suitable, conmiercially available voice agent is the Aiial Voice 
Agent, offered by AiialPhone LLC of Vernon Hills, Illinois. In response to the signal, the 
micioprocessor-based appUance 30 issues a ready prompt at step 130 to the earset 10 and 
places the voice agent in a listening state for in a pre-configured manner. In apreferred 
embodiment, the ready prompt in the appUcation may be configurable in one of many user 
selectable ways. For example, the ready prompt may be an audio stream containing a 
message pre-iecorded by the user, a generic pre-selected audio stream offered by the 
appKcation software, or a simple earcon signal characterized by short bytes or tones that are 
associated with a specific event. 

In response to the ready prompt, the user may issue a verbal command at step 140. At 
step 150 the system determines whether the user spoke. If the user does speak, then the 
method proceeds to step 160, where voice recognition processing is performed on the 
command. If the system detects silence, i.e. the user does not speak, then the method . 
proceeds to step 152, where the user is re-prompted. In accordance with a preferred 
embodiment, the number of times that the user may be re-prompted is a configurable option. 
The preferred number of re-prompts, for usabiUty purposes, is 2 times total - i.e., initial 
command and 1 re-prompt More than tbis tends to finslrate users, however the number of re- 
prompts is configurable so more tolerant users can set it higher. In this case, Ihe system then 
determines whether the user has been re-prompted the predetermined number of times at step 
156. If the user has not yet been prompted the maximum number of times, then the method 
returns to step 140 so the user may issue a command. If, on the other hand, the user has been 
prompted the predetermined number of times, then the method proceeds to step 240, where 
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the user is infqnned of the feilure to recognize a command and then the system returns to step 
115. 

Returning to the case where the user does speak, at step 160 the voice recognition 
processor associated with the voice agent preferably returns recognition confidence level 
infonnation, which may be used to determine how accurately a phrase, in this case a 
command, was recognized. The speech recognition processor preferably assigns a confidence 
level to the spoken command and then sorts the assigned confidence level into one of three 
recognition quaUty categories: high confidence (for example, above 90% confidence), low 
confidence (for example, between 70%-90% confidence), and unrecognizable (for example, 
below 70% confidence). In the most favorable situation, the confidence in the speech 
recognition is high and the method proceeds to step 1 90 where the PC implicitly verifies the 
issued command and opens a recognizer. An impHcit verification is characterized in that Ihe 
user is not prompted to verbally confirm the command because of the high confidence in 
recognizing the spoken command. At step 195, the method determines whe&er the user has 
cancelled the confirmed command. If so, the method returns to step 130 where the earset 10 
plays the ready prompt to let the user know they can restate the command. If on the other 
hand, the user does not cancel the confirmed command at step 195, then the method proceeds 
to step 210 where the command is executed. 

If the confidence in flie speech recognition is, for example, between 70% and 90%, 
then the confidence is categorized as low at step 160, and the method proceeds to step 180, 
where the earset agent appUcafion 320 sends a command verification pompt to the user. For 
example, command verification may comprise repeating the command and asking the user to 
verbally confirm its accuracy. Specifically, the user may hear through the speaker on the 
earset, "Did you say 'call John Doe'T" At step 200, the method determines whether the user 
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replies afBrm^tively to the command verification prompt. If so, then the method proceeds-to 
step 210 and the command is executed. If, however, the user does not reply affirmatively to 
the command verification prompt at step 200, then the reply is characterized as 
unrecognizable, and the user is re-prompted, at step 220, for a command. The number of 
times to re-prompt the user is preferably a configurable option. Silence by the user during the 
configurable response period may be treated as an unrecognizable response at step 220. If the 
user has been re-prompted the predetermined number of times without resulting in an 
affirmative response, then tiie method proceeds to step 240. If the user has not been re- 
prompted tiie predetermined number of times, then the method returns to step 180. 

If the spoken command is unrecognizable based, for example, on a less tiian 70% 
confidence in recognition of the command at step 160, then the metiiod proceeds to step 170, 
where the user is re-prompted, preferably repeatedly for a predetermined number of times. 
Once the user has been re-prompted the predetermined number of times, as determined at step 
172, without the voice agent receiving a recognizable command, then the user is infomed at 
step 176 of a failure to recognize the command, and the method returns to step 1 15. As noted 
above, the number of repetitions is preferably a user configurable option. If the user has not 
been re-prompted flie predetermined number of times, tiien the method proceeds from step 
172 to step 140 and the system awaits the user's command. 

Making A CaU 

Turning now to specific examples of particular commands, a preferred embodiment of 
the present invention allows the user to place a call using tiie earset commumcation system. 
Figure 10 is a flow chart aiustiating the basic course for making a call. Beginning at step 250, 
the user requests the voice agent. This corresponds to steps 1 15, 120 and 130 in Figure 9. 
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Next, step 260, the user issues a command in a predetermined form to indicate to the 
communication system the user's desire to place a caD. Preferably, the earset agent 
application 320 recognizes synonyms for commonly used commands. For example, the "call" 
command may be recognized whether the user says "call", "dial" or "get me." Preferably, the 
generalized method of Figure 9 is followed in regard to recognition rates and the process in 
the event that the command is not recognized. The actual command may request that the 
system call a person at a particular location. For example, the user may use a command, "CaU 
Steve Smith at Work." At the microprocessor-based appliance 30, the voice agent will 
therefore process the command for recognition of 1) the type of command, such as a caU; 2) 
the person to call; and 3) the location. 

Once the command has been recognized, the method proceeds to step 270, where the 
voice agent looks up the caUed party's number, such as an IP address or telephone number, at 
the requested locatioa Generally, the user's contacts are stored in memory at the 
microprocessor-based appUance 30. For example, the microprocessor-based appUance 30 
may include a software application for storing and accessing contact information. There are 
numerous software plications that are suitable for this purpose including, for example, 
Microsoft Outlook, which is available from Microsoft Corp of Redmond, Washington and 
Lotus Notes, which is avaUable from Lotus Development Corporation of North Reading, 
Massachusetts. Step 270 and the following steps of Figure 10 conespond to the step 210 of 
Figure 9. 

The voice agent tiien confirms the command to call the called party at step 280. For 
example, the voice agent impHcitly confinns the users request by stating to the user, "Calling 
Steve Smith at Work." Iftheusei does not cancel the confirmed command, the method 
proceeds to step 290, where a call is placed to the called party at the desired location. If 
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however an ej^pUcit confinnation is required, for example where the confidence in the speech 
recognition of Ihe comnaand is Low Confidence or Unrecognizable, then the method 
preferably proceeds along the paths of steps 180 or 170, respectively, in Figure 9. Reference 
may be made to the flow chart in Figure 9 for fiather detail regarding command confimiation. 
Agam, once the command is confirmed, either impUcitly or expressly, the method proceeds to 
step 290 for execution by placing the call. 

Making A Call - Alternative Course 1 
The flow chart in Figure 1 1 A shows an alternative method for placing a call using the 
earset communication system. The method shown in Figure 1 1 A generaUy foUows the 
method of Figure 10, except that the system checks that the requested location for aparticular 
person being caUed is valid. Thus steps 250 and 260 are the same in Figure 10, except that 
the method of Figure 1 1 A requires the user to specify a location for the called party. This 
method is necessary where a called party has multiple phone numbers designated by a unique 
location such as home or work. Likewise, steps 280 and 290 are present m both 
embodiments. 

Following the user's issuance of a command to call the called party at a particular 
location at step 260, the method of Figure 11 A proceeds to step 305, where the system 
deteraiines whether the requested location is vaUd. Generally, a requested location will be 
considered vaKd if the user's contact information includes a number for the called party at the 
requested location. 

If the requested location is vaUd, then the method proceeds to step 310, where the 
voice agent detemtunes the called party's number at the requested location. From there, the 
method proceeds to place the call to the called party at the requested location in accordance 
with steps 280 and 290, which are described above. If, on the other hand, the requested 
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location is invpUd, ti^eti the method proceeds to step 325, where the voice agent informs the 
user that the location is not vaKd. For example, the voice agent in step 325 may respond with: 
"Thaf s not a vaUd location; you can say Docation_l]. . .[Location_n]," where 
DocationJ], . .[Location_n] correspond to the valid locations associated with the called party. 
Alternatively, in the event where there is no number defined for a requested location, the 
system may prompt the user to enter one. Since each called party may have numerous 
numbers corresponding to different locations, for example, home, work, mobile and the like, 
the system will preferably mfomi the user of each vaUd location. Next, the user responds with 
the desired location at step 335. The method then retmns to step 305 in order to detennine if 
the location is vaUd. Once the location infomiation is determined to be vaUd at step 305, then 
the method proceeds with steps 310, 280 and 290 as described above. 

Making A Can - Alternative Course 2 

The flow chart m Figure 1 IB shows another alternative embodiment of the method 
for placmg a caU using the earset cominunication system. The initial steps are similar to the 
initial steps in Figure 1 1 A, except that in Figure 1 IB the user command at step 260 includes 
only the caUedpaxty's name. The method proceeds to step 345, where the system determines 
whether there is more than one number assigned to the caUed party's name. If more than one 
number is assigned to the called party's name, then the method proceeds to step 355, where 
the voice agent prompts the user for more mformation, such as by requesting "At which 
location?" 

At step 365, the user will respond to the prompt by speaking the location desned 
for the caUed party. The system then determines, as described above with reference to Figure 
IIB. whether the location specified by the user is vaHd at step 305 and the method progresses 
as described with reference to Figure 1 IB. 

Returning to step 345, if the method detennines that there is only one number for 
the called party, then the method proceeds to step 375, where the voice agent determines the 
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proper number fixmi the user's contact infonnation. The method then proceeds to steps 280 
and 290, which are described above, to complete placement of the call to the caUed party. 

Retrieving Schedole Information 

A preferred embodiment of the present invention allows the user to retrieve schedule 
infonnation using the earset communication system. Figure 12 is a flow chart illustrating the 
basic course for retrieving schedule information . Begimiing at step 250, the user requests the 
voice agent Thiis corresponds to steps 115, 120 and 130 in Figure 9. 

Next, at step 262, the user issues a command in a predetennined form to indicate to the 
communication system the user's desire to retrieve schedule information. Preferably, the 
generalized method of Figure 9 is foUowed with regard to recognition rates and the process in 
the event that the command is not recognized. The actual command may request for a 
description of the user's schedule. For example, the user may use a command, "What is my 
schedule today?" At the microprocessor-based appUance 30, the voice agent wiU therefore 
process the command for recognition of the user's schedule. 

The voice agent then confiims the command to retrieve schedule information for today 
at step 282. For exan^le, the voice agent in^Ucitly confirms the users request by stating to 
the user, "Retrieving schedule information for today." If the user does not cancel the 
confirmed command, liie method proceeds to step 288, where the schedule information is 
retrieved. If however an explicit confirmation is required, for example where the confidence 
in the speech recognition of the command is Low Confidence or Unrecognizable, then the 
method preferably proceeds along the paths of steps 180 or 170. respectively, in Figure 9. 
Referemje may be made to the flow chart in Figure 9 for fiirther detail regarding command 
confirmation. Again, once the command is confirmed, either implicifly or expressly, the 
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meftod proce^ to step 288 for retrieving flie schedule. Note that the user can interrupt and 
issue commands such as "next item", "previous item", "next day", "cancel", etc. 

Once the command has been recognized, the method proceeds to step 288, where the 
voice agent looks up the user's schedule. Generally, the user's schedule is stored m memory 
at the microprocessor-based appUance 30. For example, the microprocessor-based sqjpliance 
30 may include a software application for storing schedule faifonnation. There are numerous 
software appUcations that are suitable for this purpose iucluding, for example, Microsoft 
Outlook, which is available from Microsoft Coip of Redmond, Washington and Lotus Notes, 
which is available from Lotus Development Corporation of North Reading, Massachusetts. 
Finally, at step 292, the voice agent reads or plays the requested schedule information to the 
user based on the user's previous command. 

Control of Home Entertainment and Home Automation 
In another preferred embodiment, the earset communicator system functions with 
existing home control and home entertainment applications that rely heavily on devices such 
as remote controls and PC-based software interfaces to control various home fimctions. 
Implementing voice-based command and control of home fimctions using the earset 
communicator system greatly improve convenience and simphcity to the control of the home. 
Existing IR remote control units are limited to line-of-sight operation and require multiple 
button sequences to be learned and pressed for most operations. The earset communication 
system works from anywhere in the home and can respond to natural language commands, 
such as *Tut onESPN", Functions that may be undervoice control include: Television, 
Digital Music, DVD, Gaming, Lighting, HVAC (Heating Ventilation Air Conditioning), 
Motorized Blinds and the like. 
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Figure,13 is a flow chart illustrating the steps for the control of home entertainment 
functions. Figure 14 is a flow chart illustrating the steps for the control of home automation 
functions. The steps referenced below refer to software flow diagrams of both figures 13 and 
14. Beginning at step 250, the user requests the voice agent This corresponds to steps 115, 
120 and 130 in Figure 9. 

Next, at step 264 in Figure 13, the user issues a command in a predetermined form to 
indicate to the communication system the user's desire to control a home entertainment 
device. For example, the command may request that the TV be tuned to a particular channel 
as shown in Figure 13. Preferably, the generaUzed method of Figure 9 is followed with regard 
to recognition rates and the process in the event that the command is not recognized. The 
voice agent then implicitly confirms the command to control or adjust the home entertainment 
device at step 284. For example, the voice agent implicitly confirms the users request by 
stating to the user, "Tuning TV to ESPN" for the control of the TV. If the user does not cancel 
the confirmed command, the method proceeds to step 294, where upon execution of tiie 
command, the home entertainment device is controlled in the manner' commanded by the user. 
Reference may be made to the flow chart in Figure 9 for fimher detail regarding command 
confirmation. 

The generalized flow chart shown in Figure 14 illustrates the software flow for 
adjustment or control of a home automation function. The format of the generalized home 
automation command may be <adjustment or conti:ol> of the <home automation function> 
where the item in the <field> indicated is a command variable. The user jssues such a 
command at step 266 in Figure 14. The actual home automation fimction may be, for 
example, to lower the kitchen blinds. The voice agent software running on the 
microprocessor-based appliance 30 will therefore process the command for recognition of the 
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command and.for identification of the appliance to be controlled. At step 286 the voice agent 
confirms tiie command. If the user does not cancel the confinned command, the method 
proceeds to step 296, where upon execution of the command, the appliance is contix>lled in the 
manner commanded by the user. If however an explicit confimation is required, for example 
where the confidence in the speech recognition of the command is Low Craifidence or 
Unrecognizable, then the method preferably proceeds along the paths of steps 1 80 or 1 70, 
respectively, in Figure 9. Reference may be made to tiie flow chart in Figure 9 for fiirther 
detail regarding command confirmation. Again, once tiie command is confirmed, either 
implicitly or expressly, and the method proceeds to step 296 for execution of the command. 
Once the command has beai recognized, the method proceeds to step 280, where the 
appliance is adjusted in the desired mannffl:. 

As an alternative to the foregoing, tiie earset communication systan may be embedded 
with a home appKance or home entertainment device, provided that the appliance or device 
includes a read-only memory (ROM) stinctine, a random access memory (RAM) stnictiire, 
associated data and address buses, and a port for coupling the appliance or device to tiie base 
station 20. One skilled in flie art will readily adapt control of appliances to other home 
automated appliances such as home conti-oUer links witii actuators for curtains, blinds, lights, 
garage door openers, video cameras, TVs and intercoms to name just a few possible home 
appliances that may be valid appliances in tiie <iome automation fimction> field. The 
<adjustment or conti:ol> field for example could be an on/off operation, up/down volume, 
open/close, or otiier change mode fimctioa , 
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We claim: 

1 . A method of communicating with a telecommunications system, through a 
microprocessor-based ^pliance having a memory structure, over a wireless 

5 link, comprising the steps of: 

receiving an audio signal at a receiver; 
transmitting an audio signal at a transmitter; 

processing the transmitted audio signal to recognize a command; and 
controlling the microprocessor-based appliance to effect a desired mode of 
10 communication on the telecommunications system based on the command. 

2. The method of controlling the microprocessor-based appliance as claimed in 
claim 1, wherein the mode of communication is a VoN call. 

3. The method of controlling the microprocessor-based appliance as claimed in 
15 claim 1, wherein the mode of communication is a telephone caU. 

4. A wireless communication system communicating with a telecommunications 
network comprising: 

an earset having a transmitter and receiver; 

a base station having a transmitter and receiver in communication with 
20 the earset; 

a microprocessor-based appliance, having a memory structure, 
connected to the base station; and 

a speech processing program in the memory and executable by the 
appliance, said speech processing program associated with the control of a 
25 communication system, 

5. The apparatus as claimed in claim 4, wherein the earset comprises a single 
control button that when depressed alerts the speech processing program that 
an immediately following audio stream is to be interpreted as a command. 

30 6, The apparatus as claimed in claun 4, where the connection between the base 

station and the naicroprocessor-based appliance is through a universal serial 
bus port 
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7. . The apparatus as claimed in claim 4, wherein the microprocessor-based 
q)pliance further comprises a network intotface, 

8. The apparatus as claimed in claim 4 where the earset further, comprises: 

an antenna connected to the transmitter ; 
5 a microphone coupled to the transmitter; 

a receiver cormected to the antenna; and 
a speaker coupled to the receiver, 

9. A method of communicating in a user-communication interface over a wireless 
link between an earset and a base station comprising the steps of: 

10 transmitting an audio signal into a modulated carrier; 

receiving the modulated carrier to produce an audio signal; 

sending a command signal from the earset to the basestation; 

performing voice recognition on the audio signal after receiving the 
conunand signal; and 
1 5 sending control signals based on a voice command. 

10. An earset communication system that provides a user with a hands-free 
interface for communication and control functions, comprising: 

a hghtweight wireless earset having a speaker, a microphone and a 
radio transceiver coupled to the speaker and microphone; 
20 a microprocessor-based apphance having a network interfece and 

software for communication and control of a plurality of subsystems based on 
speech recognition capabilities; and 

a base station having a first interface for communicating with the earset 
over a radio link and a second interface for commimicating with the 
25 microprocessor-based appliance; 

wherem the user speaks a command into the microphone and the 
microprocessor-based appliance drives a selected subsystem to execute the 
command. " ' * 

11. The system of claim 1 0, wherein the command causes the microprocessor- 
30 based appliance to initiate a VoN call with a predetermined remote party. 

12. The system of claim 10, wherein the command causes the microprocessor- 
based appliance to initiate a PSTN telephone call. 
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13. , The system of claim 10, wherein the command causes the microprocessor- 

based appliance to adjust a home entertainment appliance. 

14. The system of claim 10, wherein the command causes the microprocessor- 
based appliance to accept voice dictation. 

5 15. The system of claim 1 0, wherein the command causes the microprocessor- 

based appliance to adjust an automated home appliance. 

16. A method for a microprocessor-based appUance to communicate with an 
earset, the method comprisiag: 

receiving a voice signal from the earset, the voice signal comprising a 
10 command; 

recognizing the coiranand in the voice signal; and 
performing a function in response to recognizing the command 

17. The method of clafan 16, wherein the microprocessor-based ^pliance is 
comiected to a communications network, the method further comprising 

15 sending a commimications signal to the communications networic. 

1 8 . The method of claim 1 6, wherein the function is a VoN call. 

19. The method of claim 1 6, wherein the function is a PSTN call. 

20. The method of claim 16, wherein the function is adjustment of a home 
appliance. 

2-0 21. The method of claim 16, wherein the function is voice dictation. 

22. The method of claim 16, wherein the function is adjustment of a home 
entertaiimient appliance. 

23. A method for a base station to communicatively couple an earset with a 
^ microprocessor-based appliance, the method comprising: 

25 receiving a voice signal from the earset, the voice signal comprising 

representation of a command; 

digitizing the voice signal into a digitized si^ai; and 

sending the digitized signal to the microprocessor-based appliance. 

24. The method of claim 23, wherein digitizmg the signal comprises: 

digitizing the voice signal into an intermediate digitized signal; 

converting the intermediate digitized signal into an analog signal; and 
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, digitizing the anriog signal into the digitized signal^ 

25. The method of claim 23, wherein receiving the voice signal from the earset 
comprises receiving a 900 MHz digital spread spectrum signal, 

26. The method of claim 23, wherein sending the digitized signal to Ihe 
5 microprocessor-based appUance comprises sending the digitized signal to a 

Universal Serial Bus port. 

27. A metliod of sending a command from an earset to a microprocessor-based 
appliance, the earset having a microphone and a command button, the method 
con:4}rising: 

10 activating the command button on the earset; 

receivmg a voice signal from the microphone, the voice signal being 

representation of a command; and 

sending the voice signal to the microprocessor-based device. 

28. The method of claim 27 further comprising (i) prompting the microprocessor- 
15 based device for receipt of the command; and (ii) receiving a ready prompt 

from the microprocessor-based appUance. 

29. The method of claim 27, wherein activating the command-button comprises 
depressing the command button. 

30. The method of claim 27, wherein receiving the voice signal further comprises 
20 performing noise cancellation on the voice signal. 

31. A microprocessor-based appUance for communicating with an earset, the 
noicroprocessor-based appUance comprising: 

a processor; 
a memory; 

25 computer instructions stored in the memory and executable by the 

processor for: 

recognizing a command in a voice signal received from the 
earset, the voice signal comprising representation of the command; and 
perfomiing a function on the microprocessor-based appUance in 

30 response to recognizing the command. 
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32. ,The paicroprocessor-based appliance of claim 31 furtho: comprising a 
communications interface for connecting the microprocessor-based appliance 
to a communications network. 

33. The microprocessor-based appliance of claim 32, wherein the communications 
5 interface is selected from the group consisting of a PSTN interface, a network 

interface, a Universal Serial Bus port, and a radio link, 

34. The microprocessor-based ^phance of claim 31, wherein a radio link 
communicatively couples the earset with the microprocessor-based apphance. 

35. The microprocessor-based ^pliance of claim 31, wherein the radio link is a 

1 0 900 MHz digital spread spectrum transceiver. 

36. The microprocessor-based appUance of claim 31, wherein the function is a 
VoNcaU. 

37. The microprocessor-based appliance of claim 31, whereui the function is a 
PSTN call. 

15 38. The microprocessor-based appliance of claim 31, wherein the function is 

adjustment of a home appliance. 

39. The microprocessor-based appliance of claim 3 1, wherein the function is voice 
. dictation. 

40. The microprocessor-based appliance of claim 31, wherein the function is 
20 adjustment of a home entertainment appliance. 

41. The microprocessor-based appliance of claim 31, wherein a voice agent 
facilitates performing the fimctioiL 

42. A base station for communicatively coupling an earset with a microprocessor- 
based appliance, the base station comprishig: 

at least one communications interface for (i) receiving a voice signal 
from the earset, the voice signal con^rising a representation of a command; (ii) 
sending a digitized signal to the microprocessor-based appliance; and 

circuitry for digitizmg the voice signal received from the earset into a 
digitized signal. 

30 43. The base station ofclaim 42 further comprising circuitry for: 

(i) digitizing the voice signal into an intermediate digitized signal; 

(ii) converting the intermediate digitized signal into an analog signal; and 
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. (iii) digitizing the analog signal into the digitized signal. 

44, A base station for communicatively coupling an earset to a microprocessor- 
based appliance, tihie base station comprising: 

a processor; 

5 a memory; 

at least one communications interface for (i) receiving a voice signal 
from the earset; and (ii) sending a digitized signal to the microprocessor-based 
appliance, and; 

computer instractions stored in the memory and executable by the 
10 processor for digitizing the signal received from the earset into the digitized 

signal. 

45 . The base station of claim 44 furflier comprising computer instructions stored in 
the memory and executable by the processor for: 

digitizmg the voice signal into an intermediate digitized signal; 
15 converting the intermediate digitized signal into an analog signal; and 

digitizing the analog signal into the digitized signal 
. 46. The base station of claim 44, further comprising mating contacts for charging a 
battery mounted in the earset. 

47. The base station of claim 44, wherein the at least one communications interface 
20 for receiving the signal from the earset comprises a 900 MHz spxGad spectrum 

transceiver. 

48. The base station of claim 44, wherein the at least one communications interface 
for sending the digitized signal to the microprocessor-based appliance 

— comprises a Universal Serial Bus port. 

25 49. An earset for communicating with a microprocessor-based appliance, the earset 

coniprising: 

a command button for pron5)ting the microprocessor-based appliance 
to receive a command; 

a speaker for generating audible sound received from flie 
30 microprocessor-based appliance; 
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, a microphone for receiving vocal sound, the vocal sound being a 
command; and 

a communications iaterface for sending a signal to the microprocessor- 
based appKance, the signal comprising a representation of the vocal sound. 

The earset of claim 49, further comprising an audio transducer for generating a 
notice sound. 

The earset of claim 49 further comprising a communications interface for 
receiving the audible sound from the microprocessor-based appliance, the 
audible sound being a ready prompt 
The earset of claim 49 fijrfher comprismg a battery. 

The earset of claim 49 ftirth^ comprising mating contacts for charging the 
battery. 

The earset of claim 49 wherein the earset comprises a processor and a memory, 
the earset forther comprising computer instructions stored in memory and 
executable by a microprocessor for performing noise cancellation on the vocal 
sound. 

The earset of claim 49, wherein the communications interface for sending a 
signal to the microprocessor-based q)pliance comprises a 900 MHz digital 
spread spectrum transceiv^. 

The earset of claim 49, wherein the communications interface for receiving 
audible sound from the microprocessor-based appUance comprises a Universal 
Serial Bus port. 

The earset of claim 49 further comprising a jack for connecting a separate 
speaker. 

The earset of claim 49 further comprismg a jack for connecting a separate 
microphone. 
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